LOCUS pDONR™/Zeo DISCLAIMER Certain terms are trademarks or registered trademarks of Invitrogen Corporation. See "Intellectual Property" in the Help file for more information. FEATURES Location/Qualifiers misc_feature complement(268..295) /note="rrnB T2 transcription termination sequence (c)" misc_feature complement(427..470) /note="rrnB T1 transcription termination sequence (c)" primer_bind 537..552 /note="M13 Forward (-20) priming site" misc_recomb 570..668 /label=attL1 /note="attL1" misc_recomb complement(1562..1658) /label=attL2 /note="attL2 (c)" misc_signal complement(1673..1692) /note="T7 Promoter/priming site (c)" primer_bind 1700..1716 /note="M13 Reverse priming site" gene 1829..2638 /note="Kanamycin resistance gene" rep_origin 2759..3432 /note="pUC origin" vector join(1570..3435,1..651) /source="pDONR%99221" /type="Donor Vector" misc_feature 669..691 /note="TEV site" source 692..1555 /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:71699 IMAGE:5261530" /tissue_type="Brain, hippocampus" /clone_lib="NIH_MGC_95" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 692..1555 /gene="HS3ST1" /gene_synonym=3OST /gene_synonym=3OST1 /db_xref="GeneID:9957" /db_xref="HGNC:5194" /db_xref="MIM:603244" CDS 692..1555 /dnas_title="heparan sulfate (glucosamine) 3-O-sulfotransferase 1" /gene="HS3ST1" /gene_synonym=3OST /gene_synonym=3OST1 /codon_start=1 /product="heparan sulfate (glucosamine) 3-O-sulfotransferase 1" /protein_id="AAH57803.1" /db_xref="GI:34785943" /db_xref="GeneID:9957" /db_xref="HGNC:5194" /db_xref="MIM:603244" /translation="MAALLLGAVLLVAQPQLVPSRTAELGQQELLRKAGTLQDDVRDG VAPNGSAQQLPQTIIIGVRKGGTRALLEMLSLHPDVAAAENEVHFFDWEEHYSHGLGW YLSQMPFSWPHQLTVEKTPAYFTSPKVPERVYSMNPSIRLLLILRDPSERVLSDYTQV FYNHMQKHKPYPSIEEFLVRDGRLNVDYKALNRSLYHVHMQNWLRFFPLRHIHIVDGD RLIRDPFPEIQKVERFLKLSPQINASNFYFNKTKGFYCLRDSGRDRCLHESKGRAHPQ VDPKLLNKLHEYFHEPNKKFFELVGRTFDWH" misc_difference 695..695 /gene="HS3ST1" /gene_synonym=3OST /gene_synonym=3OST1 /note="'A' in cDNA is 'C' in the human genome; amino acid difference: 'T' in cDNA, 'P' in the human genome. The chimpanzee genome agrees with the human genomic sequence and not the cDNA." misc_feature 692..1555 /note="HS3ST1 coding region" ORIGIN 1 CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA 61 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA 121 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 181 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATA CGCGTACCGC 241 TAGCCAGGAA GAGTTTGTAG AAACGCAAAA AGGCCATCCG TCAGGATGGC CTTCTGCTTA 301 GTTTGATGCC TGGCAGTTTA TGGCGGGCGT CCTGCCCGCC ACCCTCCGGG CCGTTGCTTC 361 ACAACGTTCA AATCCGCTCC CGGCGGATTT GTCCTACTCA GGAGAGCGTT CACCGACAAA 421 CAACAGATAA AACGAAAGGC CCAGTCTTCC GACTGAGCCT TTCGTTTTAT TTGATGCCTG 481 GCAGTTCCCT ACTCTCGCGT TAACGCTAGC ATGGATGTTT TCCCAGTCAC GACGTTGTAA 541 AACGACGGCC AGTCTTAAGC TCGGGCCCCA AATAATGATT TTATTTTGAC TGATAGTGAC 601 CTGTTCGTTG CAACACATTG ATGAGCAATG CTTTTTTATA ATGCCAACTT TGTACAAAAA 661 AGCAGGCTct gaaaacttgt actttcaagg ccgcaccgcc gagctaggcc agcaggagct 721 tctgcggaaa gcggggaccc tccaggatga cgtccgcgat ggcgtggccc caaacggctc 781 tgcccagcag ttgccgcaga ccatcatcat cggcgtgcgc aagggcggca cgcgcgcact 841 gctggagatg ctcagcctgc accccgacgt ggcggccgcg gagaacgagg tccacttctt 901 cgactgggag gagcattaca gccacggctt gggctggtac ctcagccaga tgcccttctc 961 ctggccacac cagctcacag tggagaagac ccccgcgtat ttcacgtcgc ccaaagtgcc 1021 tgagcgagtc tacagcatga acccgtccat ccggctgctg ctcatcctgc gagacccgtc 1081 ggagcgcgtg ctatctgact acacccaagt gttctacaac cacatgcaga agcacaagcc 1141 ctacccgtcc atcgaggagt tcctggtgcg cgatggcagg ctcaatgtgg actacaaggc 1201 cctcaaccgc agcctctacc acgtgcacat gcagaactgg ctgcgctttt tcccgctgcg 1261 ccacatccac attgtggacg gcgaccgcct catcagggac cccttccctg agatccaaaa 1321 ggtcgagagg ttcctaaagc tgtcgccgca gatcaatgct tcgaacttct actttaacaa 1381 aaccaagggc ttttactgcc tgcgggacag cggccgggac cgctgcttac atgagtccaa 1441 aggccgggcg cacccccaag tcgatcccaa actactcaat aaactgcacg aatattttca 1501 tgagccaaat aagaagttct tcgagcttgt tggcagaaca tttgactggc actgaTAGGA 1561 CCCAGCTTTC TTGTACAAAG TTGGCATTAT AAGAAAGCAT TGCTTATCAA TTTGTTGCAA 1621 CGAACAGGTC ACTATCAGTC AAAATAAAAT CATTATTTGC CATCCAGCTG ATATCCCCTA 1681 TAGTGAGTCG TATTACATGG TCATAGCTGT TTCCTGGCAG CTCTGGCCCG TGTCTCAAAA 1741 TCTCTGATGT TACATTGCAC AAGATAAAAT AATATCATCA TGAACAATAA AACTGTCTGC 1801 TTACATAAAC AGTAATACAA GGGGTGTTAT GAGCCATATT CAACGGGAAA CGTCGAGGCC 1861 GCGATTAAAT TCCAACATGG ATGCTGATTT ATATGGGTAT AAATGGGCTC GCGATAATGT 1921 CGGGCAATCA GGTGCGACAA TCTATCGCTT GTATGGGAAG CCCGATGCGC CAGAGTTGTT 1981 TCTGAAACAT GGCAAAGGTA GCGTTGCCAA TGATGTTACA GATGAGATGG TCAGACTAAA 2041 CTGGCTGACG GAATTTATGC CTCTTCCGAC CATCAAGCAT TTTATCCGTA CTCCTGATGA 2101 TGCATGGTTA CTCACCACTG CGATCCCCGG AAAAACAGCA TTCCAGGTAT TAGAAGAATA 2161 TCCTGATTCA GGTGAAAATA TTGTTGATGC GCTGGCAGTG TTCCTGCGCC GGTTGCATTC 2221 GATTCCTGTT TGTAATTGTC CTTTTAACAG CGATCGCGTA TTTCGTCTCG CTCAGGCGCA 2281 ATCACGAATG AATAACGGTT TGGTTGATGC GAGTGATTTT GATGACGAGC GTAATGGCTG 2341 GCCTGTTGAA CAAGTCTGGA AAGAAATGCA TAAACTTTTG CCATTCTCAC CGGATTCAGT 2401 CGTCACTCAT GGTGATTTCT CACTTGATAA CCTTATTTTT GACGAGGGGA AATTAATAGG 2461 TTGTATTGAT GTTGGACGAG TCGGAATCGC AGACCGATAC CAGGATCTTG CCATCCTATG 2521 GAACTGCCTC GGTGAGTTTT CTCCTTCATT ACAGAAACGG CTTTTTCAAA AATATGGTAT 2581 TGATAATCCT GATATGAATA AATTGCAGTT TCATTTGATG CTCGATGAGT TTTTCTAATC 2641 AGAATTGGTT AATTGGTTGT AACACTGGCA GAGCATTACG CTGACTTGAC GGGACGGCGC 2701 AAGCTCATGA CCAAAATCCC TTAACGTGAG TTACGCGTCG TTCCACTGAG CGTCAGACCC 2761 CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT 2821 GCAAACAAAA AAACCACCGC TACCAGCGGT GGTTTGTTTG CCGGATCAAG AGCTACCAAC 2881 TCTTTTTCCG AAGGTAACTG GCTTCAGCAG AGCGCAGATA CCAAATACTG TTCTTCTAGT 2941 GTAGCCGTAG TTAGGCCACC ACTTCAAGAA CTCTGTAGCA CCGCCTACAT ACCTCGCTCT 3001 GCTAATCCTG TTACCAGTGG CTGCTGCCAG TGGCGATAAG TCGTGTCTTA CCGGGTTGGA 3061 CTCAAGACGA TAGTTACCGG ATAAGGCGCA GCGGTCGGGC TGAACGGGGG GTTCGTGCAC 3121 ACAGCCCAGC TTGGAGCGAA CGACCTACAC CGAACTGAGA TACCTACAGC GTGAGCTATG 3181 AGAAAGCGCC ACGCTTCCCG AAGGGAGAAA GGCGGACAGG TATCCGGTAA GCGGCAGGGT 3241 CGGAACAGGA GAGCGCACGA GGGAGCTTCC AGGGGGAAAC GCCTGGTATC TTTATAGTCC 3301 TGTCGGGTTT CGCCACCTCT GACTTGAGCG TCGATTTTTG TGATGCTCGT CAGGGGGGCG 3361 GAGCCTATGG AAAAACGCCA GCAACGCGGC CTTTTTACGG TTCCTGGCCT TTTGCTGGCC 3421 TTTTGCTCAC ATGTT //